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For a global understanding of the physiological impact of the nuclear hormone 
1a,25-dihydroxyvitamin D3 (1 ,25(OH)2D3) the analysis of the genome-wide locations 
of its high affinity receptor, the transcription factor vitamin D receptor (VDR), is 
essential. Chromatin immunoprecipitation sequencing (ChlP-seq) in GM10855 and 
GM10861 lymphoblastoid cells, undifferentiated and lipopolysaccharide-differentiated 
THP-1 monocytes, LSI 80 colorectal cancer cells and LX2 hepatic stellate cells revealed 
between 1000 and 13,000VDR-specific genomic binding sites. The harmonized analysis 
of these ChlP-seq datasets indicates that the mechanistic basis for the action of the VDR 
is independent of the cell type. Formaldehyde-assisted isolation of regulatory elements 
sequencing (FAIRE-seq) data highlight accessible chromatin regions, which are under 
control of 1,25(OH)2D3. In addition, public data, such as from the ENCODE project, 
allow to relate the genome-wide actions of VDR and 1,25(OH)2D3 to those of other 
proteins within the nucleus. For example, locations of the insulator protein CTCF suggest 
a segregation of the human genome into chromatin domains, of which more than 1000 
contain at least one VDR binding site. The integration of all these genome-wide data 
facilitates the identification of the most important VDR binding sites and associated 
primary 1,25(OH)2D3 target genes. Expression changes of these key genes can serve 
as biomarkers for the actions of vitamin D3 and its metabolites in different tissues and 
cell types of human individuals. Analysis of primary tissues obtained from vitamin D3 
intervention studies using such markers indicated a large inter-individual variation for the 
efficiency of vitamin D3 supplementation. In conclusion, a genome-wide (over)view on 
the genomic locations of VDR provides a broader basis for addressing vitamin D's role in 
health and disease. 
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INTRODUCTION 

During evolution the secosteroid vitamin D3 became a pleiotropic 
signaling molecule (Jones et al., 1998). Initially, the molecule 
was used by early unicellular organisms to protect their DNA 
against UV-B irradiation (Holick, 2011). Far later, when the first 
fish with bones evolved, the endocrinology of vitamin D3 was 
established, and stiU is very conserved in all higher organisms, 
including humans (Bouillon and Suda, 2014). In this system, 
the energy of UV-B is used to convert 7-dehydrocholesterol into 



Abbreviations: l,25(OH)2D3, la,25-dihydroxyvitamin D3; 25{OH)D3, 25- 
hydroxyvitamin D3; ALOX5, arachidonate 5-lipoxygenase; CBS, cystathionine 
{i-synthase; CCNC, cyclin C; CDKNIA, cyclin-dependent kinase inhibitor lA; 
CHD7, chromodomain helicase DNA binding protein 7; ChlA-PET, chro- 
matin interaction analysis by paired-end tag sequencing; ChIP, chromatin 
immunoprecipitation; ChlP-seq, ChIP coupled with massive parallel sequencing; 
CTCF, CCCTC-binding factor; CYP, cytochrome P450; DNase-seq, DNase I 
hypersensitivity sites sequencing; DR3, direct repeat spaced by 3 nucleotides; 
FAIRE-seq, formaldehyde-assisted isolation of regulatory elements sequencing; 
IGV, Integrative Genomics Viewer; LPS, lipopolysaccharide; LRP5, low density 
lipoprotein receptor-related protein 5; MYC, v-myc avian myelocytomatosis viral 
oncogene homolog; PBMC, peripheral blood mononuclear cell; RXR, retinoid 
X receptor; THBD, thrombomodulin; TNFSFll, tumor necrosis factor (ligand) 
superfamily, member 11; TRPV6, transient receptor potential cation channel, 
subfamily V, member 6; TSS, transcription start site; VDR, vitamin D receptor; 
ZMIZl, zinc finger, MIZ-type containing 1. 



pre-vitamin D3, i.e., UV-B became essential for the synthesis 
of vitamin D3 (Holick, 2004) (more details in the article by 
Reichrath et al. in this issue). The central importance of this 
step is emphasized by the step-wise depigmentation of human 
skin, when modern humans started to move out of Africa some 
100,000 years ago (Hochberg and Templeton, 2010). Two hydrox- 
ylation steps are necessary for the conversion of vitamin D3 via 
25-hydroxyvitamin D3 (25(OH)D3) into the biologically active 
vitamin D3 metabolite, l,25(OH)2D3 (Norman, 2008). The latter 
molecule participates in a large number of physiological pro- 
cesses, such as bone formation, immune function and cellular 
growth and differentiation (Deluca, 2004) (more details in the 
articles by van Leeuwen et al., Hewison et al. and Munoz et al. 
in this issue). 

The transcription factor VDR is the only high-affinity tar- 
get for l,25(OH)2D3 within the cell nucleus (Haussler et al, 
1997). VDR is one of approximately 1900 transcription factors, 
which are encoded by the human genome (Vaquerizas et al., 
2009). In addition, VDR is a member of the superfamily of 
nuclear receptors, most of which are specifically activated by 
lipophilic molecules in the size of cholesterol (Carlberg and 
Molnar, 2012). Its lipophilic allows l,25(OH)2D3 to pass through 
all biological membranes, i.e., gene regulation by vitamin D 
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does not involve additional signal transduction steps, as they 
are known for hydrophilic signaling molecules, such as peptide 
hormones, growth factors and cytokines. Moreover, VDR is rather 
ubiquitously expressed, i.e., most human tissues and cell types are 
responsive to l,25(OH)2D3 (Wang et al, 2012). 

VDR shares the main structural characteristics of nuclear 
receptors, which is a highly conserved DNA-binding domain and 
a structurally conserved ligand-binding domain (Mangelsdorf 
et al, 1995). VDR's DNA-binding domain specifically contacts the 
hexameric consensus sequence RGKTSA (R = A or G, K = G 
or T, S = C or G) within the major groove of genomic DNA 
(Shaffer and Gewirth, 2002). However, like most other transcrip- 
tion factors, VDR uses a partner DNA-binding protein, in order 
to bind efficiently to its target sites. More than 20 years ago, 
this heterodimeric partner turned out to be the nuclear recep- 
tor retinoid X receptor (RXR) (Sone et al., 1991; Carlberg et al, 
1993). Steric constraints of the dimerizing DNA-binding domains 
of VDR and RXR determine the optimal binding site of the VDR- 
RXR complex as a direct repeat of two hexameric nuclear receptor 
binding motifs spaced by three nucleotides (DR3) (Umesono 
et al, 1991; Shaffer and Gewirth, 2004). Within VDR's ligand- 
binding domain, a network of some 40 mostly non-polar amino 
acids forms a ligand-binding pocket, in which l,25(OH)2D3 
and its synthetic analogs are specifically fixed with high affin- 
ity (Molnar et al., 2006). This ligand binding process induces a 
conformational change to the surface of VDR's ligand-binding 
domain, which results in a significant change of VDR's protein- 
protein interaction profile: it transforms from a repressor to an 
activator (Moras and Gronemeyer, 1998; Carlberg and Campbell, 
2013) (more details on VDR structure in the article by Molnar in 
this issue). 

Taken together, vitamin D signaling primarily comprises the 
molecular actions of the VDR, i.e., the physiological effects 
of l,25(OH)2D3 are largely identical to those of its recep- 
tor. This reduces vitamin D signaling to one central ques- 
tion: which are the most important genomic targets of VDR 
in a given tissue and which genes are controlled via these 
sites? Thus, this review focuses on the description of the 
genome-wide binding of VDR and its mechanistic implica- 
tions. This analysis will be in the context of genome-wide 
information on chromatin accessibility and the presence of 
other nuclear proteins, such as provided by the ENCODE 
consortium. 

GENOME-WIDE VDR BINDING 

The method chromatin immunoprecipitation (ChIP) was devel- 
oped, in order to monitor the binding of transcription factors to 
their genomic targets (Orlando, 2000). The core of the method is 
(i) mild chemical cross-linking of living cells or tissues, e.g., with 
1% formaldehyde, in order to fix nuclear proteins to genomic 
DNA, (ii) sonication of chromatin into small (200-400 bp) frag- 
ments, and (iii) immunoprecipitation with an antibody specific 
for the chosen nuclear protein (Maston et al., 2012). In this 
way, chromatin regions, which, at the moment of cross-linking, 
had been in contact with the protein of choice, are specifically 
enriched. A specific ChIP signal, in reference to a control (often 
unspecific IgGs), is a strong indication that the protein of choice 



had been in contact with the selected genomic region at the 
moment of cross-linking. 

At earlier times, the isolated chromatin template was analyzed 
by site-specific quantitative PGR (ChlP-qPCR). This approach 
had been used to study, for example, the extended promoter 
regions of the primary VDR target genes CYP24A1 (Vaisanen 
et al., 2005), CYP27B1 (Turunen et al., 2007), CCNC (Sinkkonen 
et al, 2005), and CDKNIA (Saramaki et al, 2006, 2009). 
Alternatively, the abundance of immunoprecipitated chromatin 
fragments had been detected by tUed microarrays (so-called 
"chips,") which covered a selection of promoter and enhancer 
regions or any other subset of the genome (ChlP-chip) . The group 
of Pike et al. had extensively used ChlP-chip, in order to locate 
VDR binding sites within the regulatory regions of the mouse 
genes Vdr (Zella et al, 2006), Trpv6 (Meyer et al, 2006), Lrp5 
(Fretz et al., 2007), Tnfsfll (also known as Rankl) (Kim et al., 
2006), Cyp24al (Meyer et al, 2010), and Cbs (Ki'iebitzsch et al, 

2011) . The latest development of the ChIP method is the unbi- 
ased analysis of the precipitated chromatin by massively parallel 
DNA sequencing (ChlP-seq), i.e., the detection of the binding 
sites of the transcription factor of choice in the complete genome. 
To date, ChlP-qPCR is primarily used for the confirmation of 
ChlP-seq results, while ChlP-chip got outdated shortly after its 
introduction. This leaves, at present, ChlP-seq as the method of 
choice for analyzing VDR's genomic binding loci. 

At present, the readouts of massive parallel sequencing are 
small sequence tags (35-50 nucleotides), but in the future there 
will be in majority longer reads used, which will lead to improved 
significance of the results. These sequence tags are aligned to a ref- 
erence genome (for human samples this is, at present, hgl9) and 
specifically represent the enriched chromatin fragments. Then 
"peak calling" software is used to identify genomic regions, in 
which significantly more sequence tags are detected than in con- 
trol reactions. Therefore, tags that accumulate as "peaks" at spe- 
cific genomic loci mark the presence of the investigated nuclear 
protein (Park, 2009; Furey, 2012). At present, ChIP is still per- 
formed with millions of cells; in case of a prominent binding site, 
most of these cells contribute to the ChIP signal, i.e., it can be 
assumed that in the majority of cells the locus is occupied by 
VDR. However, when only in some cells a site is bound by VDR, 
the respective peak is far less prominent, i.e., most likely of less 
impact for the regulation of l,25(OH)2D3 target genes. 

To date, VDR ChlP-seq data are available from (i) the 
immortalized lymphoblastoid cell lines GM10855 and GM10861 
(Ramagopalan et al., 2010), (ii) undifferentiated THP-1 
monocyte-like cells (Heikkinen et al, 201 1), (iii) lipopolysaccha- 
ride (LPS)-polarized THP-1 macrophage-like cells (Tuoresmaki 
et al., 2014), (iv) LS180 colorectal cancer cells (Meyer et al., 

2012) , and (v) LX2 hepatic stellate cells (Ding et al, 2013). The 
original publications reported between 1600 and 6200 VDR 
binding sites (in ligand-stimulated samples) within the human 
genome. However, these numbers are not directly comparable, 
since different peak calling software, alternative threshold settings 
and even an older version of the reference genome (hgl8) were 
used. A harmonized re-analysis of all six VDR ChlP-seq datasets 
with identical peak calling settings (MACS, version 2) resulted 
for l,25(OH)2D3-stimulated and unstimulated cells, respectively. 
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in following number of binding sites: 6172 and 3144 (GM10855), 
12,353 and 4072 (GM10861), 774 and 609 (undifferentiated 
THP-1), 953 and 529 (LPS-differentiated THP-1), 3777 and 165 
(LS180) and 1532 and 1474 (LX2) (Tuoresmaki et al, 2014). 

In total, the six VDR ChlP-seq datasets indicated 21,776 non- 
overlapping VDR binding sites when allowing a distance of up 
to 250 bp between the peak summits (Tuoresmaki et al., 2014). 
However, the vast majority of these VDR loci (67%) are unique 
for one of the analyzed cellular models. In contrast, under the 
above mentioned conditions only 54 sites are common within all 
six datasets. In general, this indicates that VDR displays a very 
individual pattern of cell-specific genomic locations, which over- 
laps between multiple tissues only at key sites. The VDR binding 
site of the l,25(OH)2D3 target gene ZMIZl, which is located 
15.3 kb downstream of the transcription start site (TSS), repre- 
sents an example of such a locus (Figure 1). In general, the rates 
of overlaps between the cell types follow roughly their develop- 
mental and functional relatedness, i.e., the two lymphoblastoid 
cell lines, GM10855 and GM10861, or LPS-differentiated and 
undifferentiated THP-1 cells show more overlapping VDR bind- 
ing sites than all other comparisons between the VDR ChlP-seq 
datasets. Moreover, the VDR binding profiles of ligand- stimulated 
cells matched better than those of unstimulated cells (Tuoresmaki 
etal, 2014). 

Genome-wide studies on VDR binding have changed the view 
on vitamin D signaling. The few dozens rather well character- 
ized VDR binding sites in less than lOkb distance to the TSS 
of l,25(OH)2D3 target genes (Haussler et al., 2013), which were 
known before, were complemented by thousands of additional 
VDR loci spread over the whole genome. However, the very most 
of the loci, which were highlighted by ChlP-seq, have not yet 
been validated by ChlP-qPCR or similar methods (and many 
will never be confirmed). Some previously known VDR bind- 
ing sites, such as those controlling the genes MYC (Toropainen 
et al, 2010), VDR (Zella et al, 2010), CCNC (Sinklconen et al, 
2005), and AL0X5 (Seuter et al, 2007), could be confirmed by the 
VDR ChlP-seq datasets. However, for many known l,25(OH)2D3 
target genes the ChlP-seq data suggest additional or alternative 
VDR binding sites, many of these being far more distant to the 
gene's TSS region than previously foreseen. In the past, many of 
these VDR binding sites had been overlooked due to a focus to 
only a few kb upstream of the TSS of l,25(OH)2D3 target genes. 
However, in accordance with the results of the ENCODE project 
(ENCODE-Project-Consortium et al, 2012), VDR binding sites 
are found with equal probability upstream and downstream of 
the TSS region of l,25(OH)2D3 target genes. In addition, VDR 
loci in distance of even more than 1 Mb from the gene's TSS are 
accepted as regulatory sites (more details below). 

In summary, there seem to be 1000-10,000 genomic VDR 
binding sites per cell type. This is far more than the number of pri- 
mary l,25(OH)2D3 target genes, which is in the order of 100-500 
per tissue. This even holds true for undifferentiated THP-1 cells, 
where 774 VDR loci in ligand-stimulated cells are facing 408 sta- 
tistically significantly up-regulated early l,25(OH)2D3 respond- 
ing genes (Heikkinen et al., 2011). The indicates that some genes 
are controlled by more than one VDR binding site, i.e., they may 
have a higher potential to be regulated by l,25(OH)2D3 than 



target genes with only one active VDR locus (more details on 
the transcriptome-wide response to l,25(OH)2D3 in the article 
by Campbell et al. in this issue). 

MECHANISTIC INSIGHT FROM VDR ChlP-SEQ STUDIES 

The close to 22,000 non-overlapping VDR peaks, which are indi- 
cated by the public ChlP-seq datasets (Tuoresmaki et al, 2014), 
show rather different characteristics. Despite the rather different 
total number of reported VDR peaks per cellular model, each of 
the six ChlP-seq datasets contains an in part overlapping sub- 
set of less than 200 sites, where a stimulation with l,25(OH)2D3 
resulted in a significant increase of VDR binding compared to 
unstimulated samples. These VDR loci are far more prominent 
than most of the other sites, for which ligand treatment was either 
repressive, had no effect or was only minor stimulatory. 

Another important parameter for the characterization of a 
VDR binding site is the presence or absence of a high confi- 
dence DR3-type binding site below the summit (±100 bp) of 
the respective ChlP-seq peak. This can be investigated with the 
help of binding site screening algorithms, such as provided by 
HOMER (Heinz et al, 2010). Depending on the threshold set- 
tings the software detects binding sites that deviate more or less 
from the consensus sequence. For example, for a moderate setting 
of a HOMER score of 7, from the total of 21,776 non-overlapping 
VDR sites in all six ChlP-seq datasets only 3801 (17.5%) contain 
a DR3-type sequence. Interestingly, the percentage of DR3-type 
motifs differs significantly between the datasets and ranges from 
38.2% (483 of 1264 sites) in LPS-polarized THP-1 cells via 36.4% 
(373 of 1023) in undifferentiated THP-1 cells, 28.6% (1062 of 
3706) in LS180 cells, 27.8% (611 of 2194) in LX2 cells, 13.0% (909 
of 6975) in GM10855 cells to 9.0% (1118 of 12,438) in GM10861 
cells. This indicates that the total number of identified VDR bind- 
ing sites in each cell line inversely correlates with the percentage 
of peak summits with DR3-type sites. However, when the analysis 
is restricted to the top 200 VDR sites (based on fold enrichment 
scoring), for all six ChlP-seq datasets a DR3-like sequence rate 
of more than 60% is observed, i.e., DR3 motifs are found pref- 
erentially at highly ligand responsive VDR loci. In this way, the 
different VDR ChlP-seq datasets show a very similar relationship 
between VDR occupancy and DR3 percentage. This suggests that 
the mechanistic basis for the action of the VDR is independent of 
the cell type and the total number of identified binding sites. 

Transcription factor binding site screening software, such as 
HOMER, suggests that DR3-type binding sequences are the most 
abundant sites below the summits of VDR ChlP-seq peaks. 
However, a significant number of the genomic VDR loci (depend- 
ing on the dataset 60-90% of all, see above) do not associate 
with a DR3-type site. This indicates that at these loci VDR uses 
a different mode of interaction with genomic DNA. This could 
be either the use of a different heterodimeric binding partner 
or an indirect binding "backpack" of a DNA-binding transcrip- 
tion factor (Carlberg and Campbell, 2013). In both scenarios 
the specific DNA binding site would be different to a DR3-type 
sequence. Interestingly, for the VDR ChlP-seq datasets originat- 
ing from hematopoietic cells, HOMER indicated binding sites for 
the transcription factors PU.l (also called SPIl), ESRRB (also 
called NR3B2) and GABPA as significantly enriched (Tuoresmaki 
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FIGURE 1 I Conserved genomic VDR binding in six cellular models. 

The Integrative Genomics Viewer (IGV) browser (Robinson at al., 
2011) was used to visualize the VDR binding site 15.3kb 
downstream of the ZMIZl TSS. The peak tracks display data from 
VDR ChlP-seq datasets from two B cell-like cells (dark and light 



blue), monocyte-like cells (red), macrophage-like cells (orange), colon 
cells (gray) and liver cells (violet). The cells were either unstimulated 
(— ) or treated with VDR ligand (-I-). The gene structures are shown 
in blue and the sequence of the DR3-type element below the 
summit of the VDR peak is indicated. 



et al, 2014). PU.l is well-known as a pioneer factor (Zaret and 
Carroll, 2011), i.e., as a transcription factor with (i) a high num- 
ber of genomic binding sites, (ii) a greater binding promiscuity 
and (iii) higher diversity of interactions. Pioneer factors are the 
first that bind regulatory genomic regions, such as promoters and 
enhancers, and interact with chromatin modifying enzymes, in 
order make the chromatin more accessible for regular transcrip- 
tion factors, such as VDR. At present, a direct protein-protein 
interaction of VDR with PU.l, ESRRB or GABPA has not been 
demonstrated, but for the Ets family, to which PU. 1 belongs, there 
were indications for an interaction (Tolon et al, 2000). However, 
for the pure function as a pioneer factor a direct protein-protein 
interaction with the "settler factor" is not needed. Moreover, there 
is older evidence from single gene studies that DNA binding of 
VDR is modulated by the transcription factors API (Schiile et al., 
1990) and RUNX2 (Sierra et al, 2003). In contrast, a genome- 
wide study on the interaction of VDR with the transcription 
factor TCF7L2 did not provide any evidence that the latter acts 
as a pioneer factor for VDR (Meyer et al., 2012). 

Below VDR peak summits no dominant non-DR3 binding 
sequence could be identified. Moreover, the six VDR ChlP-seq 
datasets differ in the ranking and identity of the non-DR3 sites 
found below the peaks (Tuoresmaki et al., 2014). This suggests 
that in total there must be a larger number of VDR partnering 



proteins. Most likely, these proteins have a cell-specific expression 
pattern and may explain in part the cell-specific actions of VDR 
and its natural ligand l,25(OH)2D3. Moreover, ChlP-seq datasets 
have indicated that, in contrast to steroid receptors, VDR binds 
a number of its genomic targets already in the absence of lig- 
and. These ligand-independent genomic VDR loci have a clearly 
lower rate of DR3-type sequences than ligand-dependent sites 
(Heildcinen et al., 2011). In contrast, they associate preferentially 
with proteins related to gene repression, such as demonstrated for 
the example of the CYP27B1 gene (Turunen et al, 2007). This 
implies that the functional profile of VDR is larger than that of 
its ligand (Polly et al, 2000) as previously shown for other mem- 
bers of the nuclear receptor superfamily, such as thyroid hormone 
receptor or liver X receptor (Perissi et al, 2010). 

Taken together, all VDR ChlP-seq studies confirm the pref- 
erential binding of VDR to DR3-type sequences. However, only 
one in slk of some 22,000 presently known VDR loci within the 
human genome carry a DR3 site. Thus, there have to be addi- 
tional mechanisms for the association of VDR with its genomic 
loci, which may include partnering with presently undefined 
partner proteins or the tethering to other DNA-binding tran- 
scription factors, such as pioneer factors. These should explain 
some of the ceU-specific actions as well as repressive functions of 
l,25(OH)2D3 and the VDR. 
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RESPONSES OF CHROMATIN TO 1,25(0H)2D3 

Histone proteins forming the nucleosome core are DNA- 
binding proteins but do not show any sequence specificity. 
Therefore, the complex of nucleosomes and genomic DNA, 
which is referred to as chromatin, has an intrinsic repres- 
sive potential: it prevents access of transcription factors to 
their genomic targets (Razin, 1998). This provides essential sta- 
bility to the epigenetic landscape for long-lasting regulatory 
decisions, such as gene expression in terminally differentiated 
cells (Mohn and Schubeler, 2009). In contrast, some regions 
of the epigenome show highly dynamic changes in response 
to extra- and intracellular signals, such as the activation of 
VDR by l,25(OH)2D3 binding (Talbert and Henikoff 2006). 
These changes involve the methylation of genomic DNA and/or 
reversible post-translational modifications of histone proteins, 
such as acetylation or deacetylation at exposed lysine residues 
(Narlikar et al., 2002). Dynamic chromatin modifications change 
the access to regulatory genomic regions, such as promoter 
and enhancers, for the binding of transcription factors, i.e., 
they determine whether at these regions chromatin is open or 
closed. This can be monitored genome-wide by using the method 
DNase I hypersensitive sites sequencing (DNase-seq), which high- 
lights genomic regions being most sensitive to cleavage by the 
enzyme DNase I (Crawford et al, 2006). A very similar tech- 
nique is Formaldehyde -Assisted Isolation of Regulatory Elements 
sequencing (FAIRE-seq), which identifies genome-wide accessi- 
ble DNA regions (Giresi et al, 2007) (more details on the relation 
of the epigenome and l,25(OH)2D3 in the article by Kallay et al. 
in this issue). 

At present, the only publically available dataset describing 
genome-wide effects of l,25(OH)2D3 on the epigenome, is a 
detailed FAIRE-seq time course in THP-1 cells (Seuter et al., 
2013). These data demonstrate that some 87% of the more than 
1000 VDR binding sites in this cellular model co-localize with 
open chromatin. Interestingly, at 165 of these VDR loci a strong 
l,25(OH)2D3-dependent increase of chromatin accessibility is 
found. Importantly, at 66% of these chromatin regions a DR3- 
type sequence is found, i.e., they overlap with loci, at which VDR 
binding is enhanced most by l,25(OH)2D3 stimulation (Seuter 
et al, 2013). Moreover, the binding of VDR to its genomic loci is 
a dynamic process, which takes at least some 2 h to saturate the 
sites. One example is a site located 225 kb downstream of the TSS 
of the chromodomain helicase DNA binding protein 7 {CHD7) 
gene (Figure 2). It demonstrates that at the same locus, where 
a strong ligand-dependent increase of VDR binding is observed, 
the rate of open chromatin more than doubled already 40 min 
after incubation of THP-1 cells with l,25(OH)2D3. At some 200 
additional VDR binding loci the chromatin shows detectable but 
less prominent response to l,25(OH)2D3 treatment, while at the 
remaining 500 sites the VDR ligand did not affect chromatin 
accessibility. Accordingly, only at less than 20% of the latter sites 
DR3-type sequences are found. At many of these sites, VDR binds 
already in the absence of ligand and may have a different mode of 
DNA recognition and action (see above). 

In summary, at approximately a third of its genome-wide 
binding loci VDR dynamically controls the epigenetic state of 
chromatin. At these sites, VDR binding and chromatin opening 



are tightly interconnected and provide indications for primary 
l,25(OH)2D3 target genes. This allows a better understanding of 
the l,25(OH)2D3 signaling cascade. 

THE USE OF ENCODE DATA FOR UNDERSTANDING 
1,25(0H)2D3 SIGNALING 

In addition to the above described l,25(OH)2D3-triggered chro- 
matin open, in the future there wiU be much more data avail- 
able on the interaction of VDR and l,25(OH)2D3 with the 
epigenome. This will include FAIRE-seq and DNase-seq studies 
in further l,25(OH)2D3-responsive tissues as well as investi- 
gations on changes of histone modifications and/or co-factor 
binding in l,25(OH)2D3-responsive genomic regions. However, 
already at present existing genome-wide data on the annota- 
tion of the genomes of human, mouse and other species can be 
used. The best example is the large range of data collected by 
the ENCODE consortium (ENCODE-Project-Consortium et al., 
2012). The core of the ENCODE datasets are publically avail- 
able ChlP-seq results on approximately 100 transcription factors 
and 20 histone modifications from more than 100 human cel- 
lular systems. From the latter, the human monocytic leukemia 
cell line K562 is represented with highest number of datasets, 
while the majority of the other cells has not been studied with 
the same intensity. At present, ENCODE data describe primarily 
the basal status of cells, i.e., only in a very few cases a stimulation 
with hormones, growth factor, cytokines or similar molecules 
had been performed. Neither data on l,25(OH)2D3 stimula- 
tions nor VDR ChlP-seq data are comprised in the ENCODE 
dataset. Nevertheless, the examples shown below will illustrate, 
how already on this stage ENCODE data are useful for a more 
detailed understanding of l,25(OH)2D3 signaling. 

All six VDR ChlP-seq datasets agree with observations of the 
ENCODE project that (i) transcription factors bind equally likely 
both up- and downstream of their target gene TSSs and (ii) the 
likelihood of detecting functional transcription factor binding 
sites for a given gene decreases by distance from its TSS region 
(ENCODE-Project-Consortium et al., 2012). This means that, in 
relation to the TSS of primary l,25(OH)2D3 target genes, the 
distribution of the VDR binding sites has a Gaussian shape. In 
turn, this suggest that on the same chromosome there would 
be no threshold distance for the interaction between a VDR 
binding locus and the TSS of a primary l,25(OH)2D3 target 
gene. However, there are limitations provided by higher-order 
structures of chromatin. 

Chromatin forms loops (Kadauke and Blobel, 2009), which 
contribute to many nuclear functions, such as the control of gene 
expression (Misteli, 2007). Chromatin loops segregate each chro- 
mosome into domains, which are separated by an insulator region 
(Van Bortle and Corces, 2013). Most insulator regions are asso- 
ciated with the highly conserved transcription factor CCCTC- 
binding factor (CTCF) (Schmidt et al., 2012). Therefore, ChlP- 
seq data for CTCF binding from multiple human cell lines, such 
as provided by ENCODE (ENCODE-Project-Consortium et al., 
2012), allow a first estimation of the chromatin domain borders 
(Figure 3). However, only 15-20% of all genomic CTCF binding 
sites are involved in insulator function. The method chromatin 
interaction analysis by paired-end tag sequencing (ChlA-PET) 
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FIGURE 2 I Genomic view of 1,25(OH)2D3-dependent chromatin 
opening. Tlie IGV browser visualizes the loci of a VDR locus 225 kb 
downstream of tfie CHD7 gene (±40 kb of the peak summit). The 
peak tracks display data from THP-1 cells: a time course of 
FAIRE-seq data [gray for EtOH-treated controls and turquoise for 



1,25(OH)2D3 (1,25D) treatments for the indicated time periods] and a 
VDR ChlP-seq data [red, from unstimulated cells and after 40 min 
1,25(OH)2D3 treatment]. The gene structures are shown in blue and 
the sequence of the DR3-type element below the summit of the 
VDR peak is indicated. 



(FuUwood et al., 2009) allows an assessment of the 3-dimensional 
structure of chromatin. When applied for CTCF in K562 cells it 
mapped more than 120,000 intra-chromosomal, CTCF-mediated 
chromatin interactions (ENCODE-Project-Consortium et al., 
2012). The high conservation of CTCF binding sites allows a reli- 
able extrapolation of the CTCF ChlA-PET data from K562 cells 
to THP-1 cells, for which VDR ChlP-seq data is available. The 
combination of both datasets suggests that in THP- 1 cells there 
are some 1600 chromatin domains, which contain at least one 
VDR binding site (Seuter et al, 2014). When the TSS region of 
a gene is within one of these chromatin regions, it may be a 
primary l,25(OH)2D3 target. In case of the CD14 gene, CTCF 
ChlA-PET data from K562 cells defined a chromatin domain 
spanning from 1.5 kb upstream to 57 kb downstream of the gene's 
TSS (Figures). This domain spans over the whole CD14 gene 
and comprises two VDR binding sites 24 and 26 kb downstream 
of the gene's TSS. This provides a straightforward gene regula- 
tory scenario explaining (i) the primary response of CD14 to 
l,25(OH)2D3 and (ii) why the neighboring genes of CD14 do not 
respond to VDR ligand treatment. 

The chromatin domain of the CD14 gene is with a size of less 
than 60 kb rather small (Figure 3). In contrast, one of the largest 
VDR-containing chromatin domains spans over 2.3 Mb of chro- 
mosome 8 and combines the MYC gene with four VDR binding 
sites, of which the most prominent is 1215 kb of the gene's TSS 
(Ryyniinen et al., 2013). This suggests that under the condition 
of sufficiently large chromatin domains, gene regulation by VDR 
can be over a distance of more than 1 Mb. 



At present, there is no VDR ChlA-PET data available but 
will come in the future. However, via the ENCODE exper- 
iment matrix (http://encodeproject.org/ENCODE/dataMatrix/ 
encodeDataMatrixHuman.html) there is access to ChlA-PET data 
for RNA polymerase II in HeLa human cervix carcinoma and 
MCF-7 human breast carcinoma and for estrogen receptor a in 
MCF-7 cells. The latter may be of special interests for the breast 
cancer field. 

Taken together, public ENCODE data are important tools, 
which can be used in combination with genome-wide data on 
VDR for an extrapolation on the 3-dimensional organization of 
gene regulation by l,25(OH)2D3. 

1,25(0H)2D3 TARGET GENES AS BIOMARKERS FOR THE 
VITAMIN D STATUS OF HUMAN INDIVIDUALS 

In contrast to a number of other nuclear receptor ligands, such as 
Cortisol or estrogen, the endocrinology of l,25(OH)2D3 does not 
imply any fast changes (Deluca, 2004; Norman, 2008). Under nor- 
mal circumstances, either the production in UV-B exposed skin 
or the intake of from diet or supplements should provide suffi- 
cient amounts of vitamin D3, in order to achieve optimal serum 
25(OH)D3 concentrations. The latter vitamin D metabolite is the 
widely accepted indicator of the vitamin D3 status of the human 
body (Hollis, 2005). The serum 25(OH)D3 concentrations change 
only in the order of weeks and months, such as the result of 
seasonal variations in sun exposure (Virtanen et al., 2011). This 
indicates that stimulation experiments with l,25(OH)2D3 over a 
few hours, as performed in in vitro experiments, do not represent 
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the physiological reality. In contrast, the effects of more long- 
lasting changes of serum 25(OH)D3 concentrations should be 
considered. On a genome-wide level, this was investigated first 
with primary T cells isolated from nine human individuals with 
variant serum 25(OH)D3 concentrations (Handel et al., 2013). 
The number of the observed VDR ChlP-seq peaks, which varied 
between 200 and more than 7000, correlated with the 25(OH)D3 
levels of the individuals, i.e., the higher the circulating 25(OH)D3 
concentrations, the more VDR loci were identified in T cells. 
Unfortunately, the raw data of this study is not available, i.e., a 
harmonized re-analysis in comparison with other published VDR 
ChlP-seq data cannot be performed. However, from the 14,044 
unique VDR peaks reported for the sum of the nine individuals, 
only 442 (3.1%) associated with a DR3-type sequence (based on 
HOMER score 7 settings). 

Serum 25(OH)D3 concentrations vary widely from person to 
person based on (i) varied diet and sun exposure, (ii) differ- 
ent age and/or level of adiposity and (iii) genetic and epigenetic 
variations (Engelman et al., 2008; Orton et al., 2008; Snellman 
et al., 2009). The Institute of Medicine recommends a serum 
25(OH)D3 level of 50 nM (Institute-of-Medicine, 2011), but it 
is under debate, whether this is sufficient for every individual 
(Holick, 2007). In fact, a substantial proportion of the world's 
population could be considered as vitamin D deficient. This con- 
dition may accelerate age-related bone loss and morbidity from 
falls and fractures. In addition, vitamin D insufficiency is asso- 
ciated with a number of diseases, such as cancer, autoimmune 
disorders and all components of the metabolic syndrome (more 
details in the article by Bendik et al. in this issue). 

This important medical problem guided to the question, 
whether an insight into the genome- and transcriptome-wide 



actions of VDR and l,25(OH)2D3 can help in a more accu- 
rate evaluation of the human individual's responsiveness to, and 
needs for, vitamin D. A first approach in this direction was 
done by studying peripheral blood mononuclear cells (PBMCs) 
and adipose tissue biopsies from 71 elderly, pre-diabetic indi- 
viduals, which participated in a 5-month vitamin D3 inter- 
vention trial (VitDmet) during Finnish winter (Carlberg et al., 
2013). The changes in the mRNA expression of the primary 
l,25(OH)2D3 target genes CDU and thrombomodulin (THBD), 
which had been identified in a recent comparative study as 
most reliable biomarkers (Standahl Olsen et al, 2013), in both 
PBMCs and fat samples were correlated with the alterations in 
the serum 25(OH)D3 levels of the 71 individuals. Interestingly, 
only for a subset of individuals significant correlations between 
the up-regulation of both genes and the intervention-induced 
raise in serum 25(OH)D3 concentrations were obtained. This 
suggests that, on a molecular level, not all study participants 
benefited from the vitamin D3 supplementation, because (i) 
they had already reached their individual optimal vitamin D 
status before the start of the intervention, (ii) they carry a 
genetic polymorphism making them less responsive to vita- 
min D3 or (iii) other undefined reasons (Carlberg et al., 2013). 
Interestingly, the categorization of the human individuals by 
their vitamin D responsiveness unmasked a negative correla- 
tion between changes in serum concentrations of 25(OH)D3 and 
the inflammation marker interleukin 6, i.e., the more respon- 
sive the study participants were to vitamin D3 supplementation, 
the lower was their inflammatory status. At present, a num- 
ber of other primary l,25(OH)2D3 target genes, which were 
highlighted in the comparison of VDR ChlP-seq data, are eval- 
uated for their potential to serve as even better biomarkers 
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for the vitamin D status of human individuals than CD14 
and THBD. 

In summary, vitamin D deficiency may negatively contribute 
to a number of diseases. Genome-wide insight led to the use 
of mRNA expression changes of the genes CD14 and THBD as 
biomarkers for a molecular evaluation of vitamin D3 supple- 
mentation studies. The results allow a classification of human 
individuals based on their responsiveness to vitamin D3 . 

CONCLUSIONS 

Genome-wide data on (i) the location of transcription factor 
binding sites in living cells, (ii) histone modifications and (iii) 
accessible chromatin, such as provided by ChlP-seq, DNase-seq 
and FAIRE-seq studies, have significantly changed the view on, 
and the understanding of the regulation of the entirety of the 
genes of our genome. This applies also to the transcription fac- 
tor VDR, for which at present ChlP-seq data from six human cell 
lines and the T cells of nine human individuals are available. The 
abovementioned modern genome-wide techniques allow a more 
unbiased identification of transcription factor binding sites com- 
pared to previous studies, which were mostly focused on regions 
a few kb upstream of a primary l,25(OH)2D3 target gene. VDR 
binding loci have now been shown to be localized equally likely 
up- and downstream of TSS regions in distances of even more 
than 1 Mb. 

Genome-wide studies have confirmed DR3-type sequences as 
the preferential binding sites for VDR (most likely as a het- 
erodimer with RXR), but only one in seven of the close to 20,000 
known VDR binding loci carry such a motif This is unanimously 
observed in all investigated cellular models. Therefore, there must 
be other types of binding motifs and partnering proteins that 
attract VDR to its genomic targets. These presently poorly under- 
stood alternative binding modes may explain some of VDR's 
function in the (trans)repression of its target genes. Moreover, 
the VDR cistrome seems to be largely cell-specific with only some 
50 loci overlapping in all investigated models. However, these 
conserved sites could be fundamental entry ports of VDR to 
the human genome, which may serve as the unified core of the 
various pleiotropic functions of l,25(OH)2D3. 

The 22,000 detected VDR binding loci within six cell lines 
as well as the 14,000 peaks found in primary T cells from nine 
human individuals may be far more than what is needed to 
control the physiological actions of l,25(OH)2D3, i.e., many 
sites may represent rather "noise" than having a specific func- 
tion. Therefore, different approaches to categorize VDR loci are 
useful. It turned out that VDR binding sites that (i) carry a 
DR3-type sequence, (ii) show ligand-stimulated VDR association, 
(iii) co-locate with ligand-induced chromatin opening and (iv) 
are conserved between several cellular systems may play a more 
important role in mediating the functions of l,25(OH)2D3 than 
the vast majority of other VDR sites that lack most of these prop- 
erties. Therefore, for a cellular system of interest, the combination 
of (i) a genome-wide assessment of open chromatin by DNase-seq 
or FAIRE-seq, (ii) the monitoring of genomic VDR loci by ChlP- 
seq and (iii) a screening for DR3-type sequences below the peak 
summits is an efficient tool for the prediction and identification 
of primary l,25(OH)2D3 target genes. 



FUTURE PERSPECTIVES 

Although historically l,25(OH)2D3 was understood to be a hor- 
mone controlling calcium homeostasis and bone formation, to 
date the genome-wide the actions of VDR are best understood 
and monitored in cells of the hematopoietic system. This empha- 
sizes the impact of l,25(OH)2D3 on the function of innate 
and adaptive immunity. There are first indications that the core 
actions of l,25(OH)2D3 and its receptor VDR can be extrapo- 
lated from hematopoietic cells to other tissues and cell types of the 
human body (Carlberg et al., 2013). If this holds true, the vitamin 
D status and responsiveness of a human individual can be derived 
from the response of, for example, PBMCs. Technically, various 
types of leukocytes can be collected far easier from blood samples 
than any other tissue biopsy. Like a glucose tolerance test is used 
to monitor the functionality of the carbohydrate metabolism of 
an individual, there may be in the future a higher dose vitamin 
D3 challenge test, where the (epi)genomic and transcriptomic 
profiles of leukocytes before and after supplementation are mea- 
sured. Routine measurements of healthy individuals may also in 
the future more likely be based on a few selected biomarkers, such 
as CD14 and THBD, while more complex scenarios in disease set- 
tings, such as cancer or autoimmune diseases, will be assessed 
genome- or transcriptome-wide. In the same way, basic research 
on l,25(OH)2D3 andVDRwUl shift more andmorefrom cell cul- 
ture models to primary tissues and cell types and will eventually 
reach the single cell level. 
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